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^ Abstract 



We study a problem motivated by a question related to quantum- 
error-correcting codes. Combinatorially, it involves the follo"wing graph 
parameter: 



o 
u 

f{G) =mm{\A\ + \{xeV\A:dA{x)is odd}| : ^ / 0} , 



I I where V is the vertex set of G and d^ix) is the number of neighbors 

of X in A. We give asymptotically tight estimates of / for the random 
^ graph Gn,p "when p is constant. Also, if 

O /(n)=max{/(G): \ViG)\=n} 

<N then we show that f{n) < (0.382 + o(l))n. 

rn 
O 

§ 1 Introduction 

> 

•r-H In this paper we consider a problem which is motivated by a question from 

quantum-error-correcting codes. To see how to use graphs to construct quantum- 
d error-correcting codes see, e.g., 

Given a graph G with ±1 signs on vertices, each vertex can perform at 
most one of the following three operations: Oi (flip all of its neighbors, i.e., 
change their signs), O2 (flip itself), and O3 (flip itself and all of its neighbors). 
We want to start with all +l's, execute some non-zero number of operations 
and return to all -|-l's. The diagonal distance f{G) is the minimum number 
of operations needed (with each vertex doing at most one operation). 
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Figure 1: The behavior of f{p) = hm„^oo f{Gn,p)/n as a function of p. 
Trivially, 

/(G) < 5{G) + 1 (1) 

holds, where 5{G) denotes the minimum degree. Indeed, a vertex with the 
minimum degree applies Oi and then its neighbors fix themselves applying 
O2. Let 

f{n) = max f{G), 

where the maximum is taken over all non-empty graphs of order n. Shiang 
Yong Looi (personal communication) asked for a good approximation on f{n). 

In this paper we asymptotically determine the diagonal distance of the 
random graph Gn,p for any p G (0, 1). 

We denote the symmetric difference of two sets A and B hj A A B and 
the logarithmic function with base e as log. 

Theorem 1.1 There are absolute constants Aq ~ 0.189 and pq ^ 0.894, 
, such that for G = Gn,p asymptotically almost surely: 

(t) f{G) = 6{G) + 1 forO <p< Xo orp = o{l), 
(11) \f{G) - \,n\ = 0(^1/2) for \,<p< po, 

(Hi) f{G) = 2 + min^,yey(G) |(A^(x) A N{y)) \ {x,y}\ for Pq < p < 1 or p = 

1-0(1). 

(Here 0{n^^'^) hides a polylog factor). 

Figure [1] visualizes the behavior of the diagonal distance of G^.p- In addition 
to Theorem |1.1 1 we find the following upper bound on f{n). 

Theorem 1.2 f{n) < (0.382 + o(l))n. 

In the remainder of the paper we will use a more convenient restatement 
of f{G). Observe that the order of execution of operations does not affect the 
final outcome. For any A G V = V{G), let B consist of those vertices in l^\y4 
that have odd number of neighbors in A. Let a = \A\ and b = \B\. We want to 
minimize a + b over all non-empty A C V{G). The vertices of A do an O1/O3 
operation, depending on the even/odd parity of their neighborhood in A. The 
vertices in B then do an 02-operation to change back to +1. 



see 



m and (12 
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2 Random Graphs for p = 1/2 

Here we prove a special case of Theorem when p = 1/2. This case is 
somewhat easier to handle. 

Let G = Gn,i/2 be a binomial random graph. First we find a lower bound 
on f{G). If we choose a non-empty A G V and then generate G, then the 
distribution of b is binomial with parameters n — a and 1/2, which we denote 
here by Bin{n — a, 1/2). Hence, if / is such that 

^ ( ^ ) Pr {Bm{n - a, 1/2) </ - 1 - a) = o(l), (2) 

a=l 

then asymptotically almost surely the diagonal distance of G is at least /. 
Let \ = l/n and a = a/n. We can approximate the summand in ^ by 

2n{H{a) + il-a){H{^)-l)+0(logn/n)) ^ (^^^ 

where H is the binary entropy function defined as H{p) = —plog2P — (1 — 
p) log2(l — p)- For more information about the entropy function and its prop- 
erties see, e.g., [1]. Let 

g,ia) = H{a) + (1 - a) (^H - 1^ . (4) 

The maximum of gx{a) is attained exactly for a = 2 A/3, since 

2(A-a) 

9\{a) = log2 • 

a 

Now the function 

hiX) = gxi2X/3) (5) 

is concave on A G [0, 1] since 

^"(^' = (A-l!A.og2 < °- 

Moreover, observe that h{0) = -1 and h{l) = H{2/3) - 1/3 > 0. Thus the 
equation h{X) = has a unique solution Aq and one can compute that 

Ao = 0.1892896249152306 ... (6) 

Therefore, if A = Aq — Klogn/n for large enough i^' > 0, then the left 
hand side of ^ goes to zero and similarly for A = Aq + K\ogn/n it goes to 
infinity. In particular, f{G) > (Aq — o(l))n asymptotically almost surely. 
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Let us show that this constant Aq is best possible, i.e., asymptotically 
almost surely f{G) < (Aq + K\ogn/n)n. Let A = Aq + K\ogn/n, n be large, 
and / = An. Let « = 2A/3 and a = [anj . We pick a random a-set A G V and 
compute b. Let Xa be an indicator random variable so that = 1 if and 
only if 6 = b{A) < I - a. Let X = E|A|=a ^a- We succeed if X > 0. 

The expectation E{X) = {'^) Pr {Bin{ n — a, 1/2) <l — a) tends to infinity, 
by our choice of A. We now show that X > Q asymptotically almost surely by 
using the Chebyshev inequality. First note that for n C 7^ we have 

Cav{XA, Xc) = Pr(X^ = = 1) - Pr(X^ = 1) Pr(Xc = 1) = 0. 

Indeed, if x e F \ (A U C), then Pr(a; G B{A)\Xc = 1) = 1/2, since A \ 
C 7^ and no adjacency between x and all vertices in A \ C is exposed 
by the event Xc — 1. Similarly, if ,t G C \ A, then A fl C 7^ and an 
adjacency between x and A fl C is independent of the occurrence of Xc = 1. 
This implies that Pr(a; G B{A) | = 1) = 1/2 as well. Thus Pt{Xa = 
l\Xc = 1) = Pr (Bin{n — a, 1/2) < I — a) = Pr(X^ = 1), and consequently, 
Cov{Xa,Xc)^0. 

Now consider the case when A D C = ^. Let s be a vertex in A. Define 
a new indicator random variable Y which takes the value 1 if and only if 
\B{C)\{s]\<l - a. Observe that 

Pr(r = 1) = Pr {Bin{n - a - 1, 1/2) < / - a) 

< 2 Pr {Bin{n - a, 1/2) < / - a) = 2 Vy{Xa = 1). 

Moreover, 

Pr(XA = l\Y = 1) = Pr {Bin{n - a, 1/2) < Z - a) = Pr(XA = 1), 

since for every x & V \ A the adjacency between x and s is not influenced 
by y = 1. Finally note that Xc < Y. Thus, 

Cov{Xa, Xc) < Pt{Xa = Xc = I) < Pt{Xa = Y = 1) 

= Pr{Y = 1) Fr{XA = 1|F = 1) < 2 (Pr(X^ = 1))^ . 

Consequently, 

Var{X)^E{X)+ ^ Cov{Xa,Xc)+ ^ Cov{Xa,Xc) 

AnCj^0,A¥=C AnC=$ 

<E{X)+2 J2 (Pr(^A = l))' 

Anc=0 

= EiX) + 2Q - (Pr(X^ = 1))^ = o (£;(X)^) , 

as E{X) = {1)Ft{Xa = 1) tends to infinity and (";") = o((^)). Hence, 
Chebyshev's inequality yields that X > asymptotically almost surely. 
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Remark 2.1 A version of the well-known Gilbert- Varshamov bound (see, 
e.g., J3]) states that if 

i-i , . 

2-"^r 3^<1, (7) 
i=\ ^ ' 

then f{n) > I. Observe that this is consistent with bound ([2]). Let A = l/n. 
We can approximate the left hand side of ([T]) by 

2n.(/^(A)+Alog2 3-l+o(l)) 

One can check after some computation that 

H{X) + X\og^3-l = gxi2X/3). 
Therefore, ^ and ([T]) give asymptotically the same lower bound on f{n). 



3 Random Graphs for Arbitrary p 

Let G = Gn,p be a random graph with constant p G (0, 1). 

Observe that for a fixed set A C V, |A| = a, the probability that a vertex 
from V \A belongs to -6(^4) is 

0<j<f ^ ^ 

(If this is unfamiliar, expand (1 — 2p)" as ((1 — p) — p)"' and compare). 



3.1 0<j9<Ao 

For p < Ao we begin with the upper bound f{G) < 5{G) + 1, see ([T|. For the 
lower bound it is enough to show that 

I \ Ft {Bin{n — a,p{a)) < pn — a) = o{l), (8) 

2<a<pn 

since S{G) + 1 < np asymptotically almost surely. (We may assume that 
p = otherwise S{G) = with high probability and the theorem is 

trivially true.) This implies that if 1^41 + \B\ < pn, then 1^41 = 1. 
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3.1.1 p Constant 

We split this sum into two sums for 2 < a < i/n and i/n < a < pn, respec- 
tively. Let X = Bin{n — a,p{a)) and 

pn — a p 1 

e = l- — — - > 1 - = 1 > 0. 9 

{n - a)p{a) " p{2) 2 - 2p ^ ^ 

Thus, by Chernoff 's bound, 

PT{Bin{N, p) < (1 - 9)Np) < e~^^^Pl^ (10) 

we see that 

Pr [Bin{n - a,p{a)) < pn - a) = Pr (X < (1 - e)E{X)) 

< exp{-e^E{X)/2} 
= exp{-0(ra)}, 

and consequently, 

I \ Pt {Bin{n — a,p{a)) < pn — a) < ^/n( j exp{— 0(n)} 

2<a<v^ VV 



< exp{0(vn logn)} exp{— 9(?7.)} 

= 0(1). 



Now we bound the second sum corresponding to ^/n < a < pn. Note that 



^ ^ Pi {Bin{n — a, p{a)) < pn — a) 

y/n<.a<pn 

= Q (^^^ (n-a^- + ©(e-^^"'''))") < pn - a") 

< ^2"'^(p)+o(l) = o(l). 

Here is defined in ([s]) and the right hand limit is zero since p < \q. 
3.1.2 p = o(l) 

We follow basically the same strategy as above and show that ([s]) holds for 
large a and something similar when a is small. Suppose then that p = 1/uj 
where u = uj{n) oo. First consider those a for which ap > Xjuj^l"^ . In this 
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case p{a) > (1 - e-^''P)/2. Thus, 

( \ Py {Bin{n — a,p{a)) < pn — a) 

^1 / .1 /2 V^/ 



ap>l/ijl/2 
a<nj) 



^ gO(n log a;/a;)g-n(n/a;l/2) ^ ^^^^^^^ 



ap>l/ajl/2 
a<np 



If ap < Xjuj^l'^ then p(a) = ap(l + 0(ap)). Then 
I j Pr (i?m(n — a, p(a)) < pra — a) 

, .1/9 V*^/ 



ap<l/a;l/2 
2<a<np 



< (^e--A°)'^ = o(l) 



'11^ 



ap<l/Lji/2 
2<a<np 



provided np > lllogn. 

If np < log n — log log n then G = Gn,p has isolated vertices asymptotically 
almost surely and then f{G) = 1. So we are left with the case where logra — 
log log n < np < 11 log n. 

We next observe that if there is a set A for which 2 < |A| and |A| + |i?(74)| < 
np then there is a minimal size such set. Let Ha = {A, Ea) be a graph with 
vertex set A and an edge {v,w) G Ea if and only if v,w have a common 
neighbor in G. Ha must be connected, else A is not minimal. So we can find 
t < a — 1 vertices T such that A U T spans at least t + a — 1 edges between A 



and T. Thus we can replace the estimate (11) by 



ap<l/a;i/2 *=! 
2<a<np 



2<a<np 



ap<l/Lji/2 
2<a<?ip 

3.2 < 1? < 1 

First let us define the constant p^. Let 

Po ~ 0.8941512242051071... (12) 
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be a root of 2p — = Aq. For the upper bound let A — {x,y}, where 

X and y satisfy \N{x) A N{y)\ < \N{x') A N{y')\ for any x',y' G V{G). 
Then B = B{A) = N{x) A N{y), and thus, asymptotically almost surely 
\B\ < {2p — 2p'^)n plus a negligible error term o{n). (We may assume that 
1 —p — Q. (^^^) ; for otherwise we have two vertices of degree n — 1 with high 
probability, and hence, f{G)—2.) 

To show the lower bound it is enough to prove that 



E 

3<a<(2p-2p2)n 



\ Pr {^Bin{n — a,p{a)) < {2p — 2p^)n — a) = o(l). 



Indeed, this implies that if \A\ + \B\ < {2p - 2p'^)n, then \A\ = 1 or 2. But if 
1^41 = 1, then in a typical graph \B\ — {p + o{l))n > {2p — 2p^)n since p > 1/2. 

3.2.1 p Constant 

As in the previous section we split the sum into two sums for 3 < a < ^/n and 
\/n < a < pn, respectively. Let 

{2p-2f)n-a 2p-2p' 
[n — a)p{a) p[a) 

To confirm the second inequality we have to consider two cases. The first one 
is for a odd and at least 3. Here, 

2p-2p2 2p-2p2 

'-^>'-^ = ^^''-'^ 

The second case, for a even and at least 4, gives 

p{a) p{2) 
Now one can apply Chernoff bounds with the given e to show that 

Y \] Pr {Bin{n - a,p{a)) < {2p - 2p^)n - a) = o(l). 

Now we bound the second sum corresponding to ^/n < a <{2p — 2p'^)n. Note 
that 

I j Pr [Bin{n — a,p{a)) < {2p — 2p^)n — a) 
Y [Bin (n-a,- + 0(e-^("'^'))'] < {2p - 2p^)n - a) 

v^<a<(2p-2p2)n ^ ^ ^ ^ 

< ^2'^'^(2p-2p2)+o(l) _ 

since p > po implies that 2p — 2p^ < Aq. 
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3.2.2 p=l-o(l) 

One can check it by following the same strategy as above and in Section 



3.1.2 



3.3 Xo<p<Po 

Let a = 2Ao/3, a = [an\ . Fix an a-set A C V and generate our random graph 
and determine B = B{A) with b = \B\. Let e = [logn)'^/ ^/n and let Xa be 



the indicator random variable for a + b < (Aq + e)n and X = YIa-^a- Then 

p{a) = 1 + e-^(") 

and with gx{a) as defined in Q, 

E(X) =exp{((7Ao+.(2Ao/3) + o(l))nlog2}. (13) 

Now 



1 — a / \1 — a 



gx{a) + e\og^ [ |— ^ ) + Oie''' 



Plugging this into (13) with A = Aq and a = 2Xq/3 we see that 

E{X) = exp I f £log2 ( ^^^) + 0{e')] n\og2\ = e^(('-^<^'''\ (14) 



Ao/3 

Next, we estimate the variance of X. We will argue that for A,C G (^) 
either |y4 A C| is small (but the number of such pairs is small) or 1^4 A C| is 
large (but then the covariance Cov{Xa, Xc) is very small since if we fix the 
adjacency of some vertex x to C, then the parity of \N{x) n{A\C)\ is almost 
a fair coin flip). Formally, 

Var{X) =E{X) + Ea^c Cov{Xa, Xc) 

<E{X) + E|^AC|<2v/^Pr(^^ = ^c = l) 

+ Z]|ylAC|>2v^,|AnC|>v^C'0f (Xa,Xc) 

+ E\Anc\<V^P<XA = Xc = l). 



Since E{X) goes to infinity, clearly E{X) = o(-E'(X)^). We show in Claims 3.1 



3.2 and 3.3 that the remaining part is also bounded by o{E{X)'^). Then 



Chebyshev's inequality will imply that X > asymptotically almost surely. 
Claim 3.1 E\AAc\<2Vn^<XA = Xc = l)= o{E{Xf) 
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Proof. We estimate trivially Pt{Xa = Xq = I) < Pt{Xa = 1). Then, 

1) 



\AAC\<2^ ^ ^ 0<j<v^ ^ ^ ^ ^ 



0<j<v^ 

Thus, ^ yields that E|aac|<2v/;i P^^^i = Xc = 1) = o{E{Xf). □ 

Claim 3.2 E|AAC|>2v/^,|Anc|>v^ ^«^(^a, Xc) = o{E{Xf) 

Proof. If X e V \ {AU C), then Pr(x G 5(A)|Xc = 1) = 2-i+°(i/"), since 
we can always find at least ^/n vertices in A \ C with no adjacency with x 
determined by the event Xc = 1. Similarly, if x G C \ A, then there are at 
least ^/n — 1 vertices in ACiC such that their adjacency with x is independent 
of the occurrence of Xc = 1. This implies that 

Pr(X^ = l|Xc = 1) = (''7'')2-^""'^)+°(') = 2°«Pr(XA = l), 

and consequently, Cov{Xa, Xc) = o(Pr(X^ = 1)^). Hence, 

CoviXA,Xc) < (^^ (Pr(X^ = If) = o{E{Xf). 



□ 



Claim 3.3 E|Anc|<v^i ^^(^^ = Xc = I) = o{E{Xf 



Proof. First let us estimate the number of ordered pairs {A, C) for which 
\AnC\< v^. Note, 



E 1- E 



n\ s-^ n — a\ a 



a I ^ — ' \a — I 

|AnC|<v^ 0<i<v^ 

n\ f n — a\ f a 
a / \ \/n 



^ ^n{H(a)+H{j^){l-a)+o(l)) _ ^^^^^ 

Now we will bound Pr(XA = Xc = 1) for fixed a-sets A and C. Let S G A\C 
be a set of size s = \S\ = [v^J • Define a new indicator random variable Y 
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which takes the value 1 if and only if \B{C) \ S"! < (Aq + e)n — a. Clearly, 
Xc<Y and 

Fr(Y = 1) = Pr (Bin{n — a — s,p{a)) < (Aq + £)n — a) 



0<i< (Ao+e)n— a 

= 2^+°W Pt{Xa = 1), 



n — a 



2^—{n—a) 



Now if we condition on the existence or otherwise of all edges F' between C 
andV\S then ii x E V \ A 



Pr(x G B{A) I F' and F") e 



1 - (1 - 2py 1 + (1 - 2py 



2 ' 2 

where F" is the set of edges between x and A \ S. This imphes that 

|2-(n-a)+o(l) 



n — a 
i 



Ft{Xa = i|y = 1) = Yl 

0<i<{\o+e)n~a 

= 2°« Pr(X^ = l), 

Consequently, 

Pt{Xa = Xc = 1)< Pt{Xa = Y = 1)< 2v^+°(i) Pr(XA = 1)^ 
Hence, (jlsj) implies 



J2 Pr(XA = Xc, = 1) < 2K^H+^(i^)(i-")+°(i)) Pr(X^ = l)^. 
|Anc|<v^ 

To complete the proof it is enough to note that 

E{Xy = 2"(2J^W+o(i)) pr(x^ = 1)2 

and 



2H(a) > H(a) + H 



a 



1 — a 



'I -a). 



Indeed, the last inequality follows from the strict concavity of the entropy 
function, since then {1 ~ a)H [j^) + aH{0) < H{a) with the equality for 
a = only. □ 

Now we show that f{Gn,p) > (Aq — e)n. We show that 

Pr {Bin{n — a,p{a)) < (Aq — e)n — a) = o(l). 



E 
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As in previous sections we split this sum into two sums but this time we make 
the break into 1 < a < (logn)^ and (logn)^ < a < (Aq — e)n, respectively. 
In order to estimate the first sum we use the Chernoff bounds with deviation 
1 — 6 from the mean where 

_ ^_ (Aq -e)n-a ^ ^_ Ap - g ^ ^ _ Xp - e _ e_ 
{n — a)p{a) ~ p{a) ~ Aq Aq 

Consequently, 

( ) Pr (Binin — a,p(a)) < (Aq — e)n — a) 



2<a<(logn) 



<(logn)^( ^ ] exp{-Q(logn)^} 
V(logn)V 

< exp{-^](-(logr^)^)} = o(l). 
Now we bound the second sum corresponding to log log n < a < (Aq — e)n. 

I j Pr {Bin{n — a,p{a)) < (Aq — s)n — a) 

log log n<a<(Ao— £)n 

^ 2n{h{\o~e)+o{l/n)) ^ 

4 General Graphs 



Here we present the proof of Theorem 1.2 First, we prove a weaker result 
f{n) < (0.440... + o(l))n. 

Suppose we aim at showing that f{n) < An. We fix some a and p and let 
a = an and r = pn. For each a-set A let R{A) consist of all sets that have 
Hamming distance at most r from B{A). If 

Q = 2"(^(")+^('')+°(i» > 2^ (16) 

then there are A, A' such that R{A) fl R{A') 9 C is non-empty. This means 
that C is within Hamming distance r from both B = B{A) and B' = B{A'). 
Thus \B AB'\ < 2r. 

Let all vertices in A" = A A A' flip their neighbors, i.e., execute operation 
Oi. The only vertices outside of A" that can have an odd number of neighbors 
in A" are restricted to {B A B') U{An A'). Thus 

f{G) <\AAA'\ + \{B A B') U (A n A')| < 2a + 2r = 2n{a + p). (17) 
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Consequently, we try to minimise a + p subject to H{a) + H{p) > 1. Since the 
entropy function is strictly concave, the optimum satisfies a = p, otherwise 
replacing each of a,p by (a + p)/2 we strictly increase H{a) + H{p) without 
changing the sum. Hence, the optimum choice is 

a = p^ 0.11002786443835959 . . . 

the smaller root of H{x) = 1/2, proving that f{n) < (0.440 . . . + o{l))n. 



In order to obtain a better constant we modify the approach taken in (16). 
Let us take 6 = 0.275, a = 0.0535, a = \_an\ , d = [6n\ . Look at the collection 
of sets B{A), A e ('"'). This gives (^) = 2"(-f^(")+"(i)) binary n-vectors. 

We claim that some two of these vectors are at distance at most d. If not, 
then inequality (5.4.1) in [3] says that 

H{a) + o(l) < min{l + g{u^) - g{u^ + 25u + 25) : Q < u < I - 25}, 



where g{x) = H{{1 — \/l — x)/2). In particular, if we take u = 1 — 25 = 0.45, 
we get 0.30108 + o(l) < 0.30103, a contradiction. 

Thus, we can find two different a-sets A and A' such that \B{A) AB{A') \ < 



d. As in (17), we can conclude that /(G) <2a + d< (0.382 + o{l))n. 
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